Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 18575 |
| Missing cells | 19 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 12.0 MiB |
| Average record size in memory | 679.8 B |
Variable types
| Text | 2 |
|---|---|
| DateTime | 2 |
| Numeric | 7 |
| Categorical | 8 |
cubeCapacity is highly overall correlated with cylinder and 3 other fields | High correlation |
cylinder is highly overall correlated with cubeCapacity and 3 other fields | High correlation |
doorNumber is highly overall correlated with type | High correlation |
fuel is highly overall correlated with cubeCapacity and 1 other fields | High correlation |
powerHP is highly overall correlated with cubeCapacity and 3 other fields | High correlation |
powerKW is highly overall correlated with cubeCapacity and 3 other fields | High correlation |
targetPrice is highly overall correlated with powerHP and 2 other fields | High correlation |
type is highly overall correlated with doorNumber | High correlation |
yearIntroduced is highly overall correlated with targetPrice | High correlation |
doorNumber is highly imbalanced (56.4%) | Imbalance |
transmission is highly imbalanced (50.0%) | Imbalance |
vehicleID has unique values | Unique |
Reproduction
| Analysis started | 2025-10-13 19:34:05.465613 |
|---|---|
| Analysis finished | 2025-10-13 19:34:10.666419 |
| Duration | 5.2 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
vehicleID
Text
Unique
| Distinct | 18575 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.4023149 |
| Min length | 6 |
Unique
| Unique | 18575 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | V_1232 |
|---|---|
| 2nd row | V_1233 |
| 3rd row | V_1234 |
| 4th row | V_1235 |
| 5th row | V_1236 |
| Value | Count | Frequency (%) |
| v_12317 | 1 | < 0.1% |
| v_12318576 | 1 | < 0.1% |
| v_1232 | 1 | < 0.1% |
| v_1233 | 1 | < 0.1% |
| v_1234 | 1 | < 0.1% |
| v_1235 | 1 | < 0.1% |
| v_1236 | 1 | < 0.1% |
| v_1237 | 1 | < 0.1% |
| v_1238 | 1 | < 0.1% |
| v_1239 | 1 | < 0.1% |
| Other values (18565) | 18565 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 34769 | |
| 2 | 26193 | |
| 3 | 26193 | |
| _ | 18575 | |
| V | 18575 | |
| 4 | 7618 | 4.4% |
| 5 | 7595 | 4.3% |
| 6 | 7518 | 4.3% |
| 7 | 7514 | 4.3% |
| 8 | 7084 | 4.1% |
| Other values (2) | 13014 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 174648 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 34769 | |
| 2 | 26193 | |
| 3 | 26193 | |
| _ | 18575 | |
| V | 18575 | |
| 4 | 7618 | 4.4% |
| 5 | 7595 | 4.3% |
| 6 | 7518 | 4.3% |
| 7 | 7514 | 4.3% |
| 8 | 7084 | 4.1% |
| Other values (2) | 13014 | 7.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 174648 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 34769 | |
| 2 | 26193 | |
| 3 | 26193 | |
| _ | 18575 | |
| V | 18575 | |
| 4 | 7618 | 4.4% |
| 5 | 7595 | 4.3% |
| 6 | 7518 | 4.3% |
| 7 | 7514 | 4.3% |
| 8 | 7084 | 4.1% |
| Other values (2) | 13014 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 174648 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 34769 | |
| 2 | 26193 | |
| 3 | 26193 | |
| _ | 18575 | |
| V | 18575 | |
| 4 | 7618 | 4.4% |
| 5 | 7595 | 4.3% |
| 6 | 7518 | 4.3% |
| 7 | 7514 | 4.3% |
| 8 | 7084 | 4.1% |
| Other values (2) | 13014 | 7.5% |
registrationDate
Date
| Distinct | 4859 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 145.2 KiB |
| Minimum | 1990-06-18 00:00:00 |
|---|---|
| Maximum | 2024-12-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
kilometers
Real number (ℝ)
| Distinct | 16693 |
|---|---|
| Distinct (%) | 89.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 177312.84 |
| Minimum | 160 |
|---|---|
| Maximum | 2795490 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 160 |
|---|---|
| 5-th percentile | 45698.7 |
| Q1 | 117883.5 |
| median | 169721 |
| Q3 | 226375.5 |
| 95-th percentile | 329084.1 |
| Maximum | 2795490 |
| Range | 2795330 |
| Interquartile range (IQR) | 108492 |
Descriptive statistics
| Standard deviation | 94019.896 |
|---|---|
| Coefficient of variation (CV) | 0.53024866 |
| Kurtosis | 87.895031 |
| Mean | 177312.84 |
| Median Absolute Deviation (MAD) | 54255 |
| Skewness | 4.0894384 |
| Sum | 3.293586 × 109 |
| Variance | 8.8397409 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 160 | 28 | 0.2% |
| 198722 | 8 | < 0.1% |
| 121604 | 8 | < 0.1% |
| 308405 | 8 | < 0.1% |
| 46957 | 7 | < 0.1% |
| 245610 | 7 | < 0.1% |
| 214349 | 6 | < 0.1% |
| 159356 | 6 | < 0.1% |
| 104813 | 6 | < 0.1% |
| 161 | 6 | < 0.1% |
| Other values (16683) | 18485 |
| Value | Count | Frequency (%) |
| 160 | 28 | |
| 161 | 6 | < 0.1% |
| 216 | 1 | < 0.1% |
| 234 | 1 | < 0.1% |
| 655 | 1 | < 0.1% |
| 998 | 1 | < 0.1% |
| 1362 | 1 | < 0.1% |
| 1780 | 1 | < 0.1% |
| 1850 | 1 | < 0.1% |
| 2070 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2795490 | 1 | |
| 2667431 | 1 | |
| 2340311 | 1 | |
| 2249130 | 1 | |
| 982968 | 1 | |
| 938048 | 1 | |
| 843017 | 1 | |
| 808613 | 2 | |
| 806029 | 1 | |
| 781195 | 1 |
colour
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 19 |
| Missing (%) | 0.1% |
| Memory size | 968.9 KiB |
| Grey | |
|---|---|
| Black | |
| Blue | |
| White | |
| Red | 372 |
| Other values (10) | 599 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.4042358 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Red |
|---|---|
| 2nd row | Red |
| 3rd row | Red |
| 4th row | Red |
| 5th row | Black |
Common Values
| Value | Count | Frequency (%) |
| Grey | 8733 | |
| Black | 5626 | |
| Blue | 1655 | 8.9% |
| White | 1571 | 8.5% |
| Red | 372 | 2.0% |
| Green | 246 | 1.3% |
| Brown | 237 | 1.3% |
| Beige | 40 | 0.2% |
| Yellow | 32 | 0.2% |
| Orange | 21 | 0.1% |
| Other values (5) | 23 | 0.1% |
| (Missing) | 19 | 0.1% |
Length
| Value | Count | Frequency (%) |
| grey | 8733 | |
| black | 5626 | |
| blue | 1655 | 8.9% |
| white | 1571 | 8.5% |
| red | 372 | 2.0% |
| green | 246 | 1.3% |
| brown | 237 | 1.3% |
| beige | 40 | 0.2% |
| yellow | 32 | 0.2% |
| orange | 21 | 0.1% |
| Other values (5) | 23 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12977 | |
| r | 9247 | |
| G | 8980 | |
| y | 8733 | |
| B | 7562 | |
| l | 7363 | |
| a | 5651 | |
| c | 5626 | |
| k | 5626 | |
| u | 1659 | 2.0% |
| Other values (16) | 8301 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 81725 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 12977 | |
| r | 9247 | |
| G | 8980 | |
| y | 8733 | |
| B | 7562 | |
| l | 7363 | |
| a | 5651 | |
| c | 5626 | |
| k | 5626 | |
| u | 1659 | 2.0% |
| Other values (16) | 8301 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 81725 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 12977 | |
| r | 9247 | |
| G | 8980 | |
| y | 8733 | |
| B | 7562 | |
| l | 7363 | |
| a | 5651 | |
| c | 5626 | |
| k | 5626 | |
| u | 1659 | 2.0% |
| Other values (16) | 8301 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 81725 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 12977 | |
| r | 9247 | |
| G | 8980 | |
| y | 8733 | |
| B | 7562 | |
| l | 7363 | |
| a | 5651 | |
| c | 5626 | |
| k | 5626 | |
| u | 1659 | 2.0% |
| Other values (16) | 8301 |
aestheticGrade
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 990.0 KiB |
| Bad | |
|---|---|
| Very Bad | |
| Medium | |
| Good | |
| Very Good |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.570821 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Very Good |
|---|---|
| 2nd row | Bad |
| 3rd row | Bad |
| 4th row | Bad |
| 5th row | Bad |
Common Values
| Value | Count | Frequency (%) |
| Bad | 6575 | |
| Very Bad | 6203 | |
| Medium | 3623 | |
| Good | 1435 | 7.7% |
| Very Good | 739 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bad | 12778 | |
| very | 6942 | |
| medium | 3623 | 14.2% |
| good | 2174 | 8.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 18575 | |
| B | 12778 | |
| a | 12778 | |
| e | 10565 | |
| V | 6942 | 6.7% |
| r | 6942 | 6.7% |
| y | 6942 | 6.7% |
| 6942 | 6.7% | |
| o | 4348 | 4.2% |
| M | 3623 | 3.5% |
| Other values (4) | 13043 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 103478 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 18575 | |
| B | 12778 | |
| a | 12778 | |
| e | 10565 | |
| V | 6942 | 6.7% |
| r | 6942 | 6.7% |
| y | 6942 | 6.7% |
| 6942 | 6.7% | |
| o | 4348 | 4.2% |
| M | 3623 | 3.5% |
| Other values (4) | 13043 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 103478 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 18575 | |
| B | 12778 | |
| a | 12778 | |
| e | 10565 | |
| V | 6942 | 6.7% |
| r | 6942 | 6.7% |
| y | 6942 | 6.7% |
| 6942 | 6.7% | |
| o | 4348 | 4.2% |
| M | 3623 | 3.5% |
| Other values (4) | 13043 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 103478 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 18575 | |
| B | 12778 | |
| a | 12778 | |
| e | 10565 | |
| V | 6942 | 6.7% |
| r | 6942 | 6.7% |
| y | 6942 | 6.7% |
| 6942 | 6.7% | |
| o | 4348 | 4.2% |
| M | 3623 | 3.5% |
| Other values (4) | 13043 |
mechanicalGrade
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 978.3 KiB |
| Bad | |
|---|---|
| Medium | |
| Good | |
| Very Good | |
| Very Bad |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 4.9241454 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Very Good |
|---|---|
| 2nd row | Good |
| 3rd row | Good |
| 4th row | Good |
| 5th row | Very Good |
Common Values
| Value | Count | Frequency (%) |
| Bad | 7267 | |
| Medium | 4203 | |
| Good | 3574 | |
| Very Good | 1903 | 10.2% |
| Very Bad | 1628 | 8.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bad | 8895 | |
| good | 5477 | |
| medium | 4203 | |
| very | 3531 | 16.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 18575 | |
| o | 10954 | |
| a | 8895 | |
| B | 8895 | |
| e | 7734 | |
| G | 5477 | 6.0% |
| i | 4203 | 4.6% |
| M | 4203 | 4.6% |
| m | 4203 | 4.6% |
| u | 4203 | 4.6% |
| Other values (4) | 14124 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 91466 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 18575 | |
| o | 10954 | |
| a | 8895 | |
| B | 8895 | |
| e | 7734 | |
| G | 5477 | 6.0% |
| i | 4203 | 4.6% |
| M | 4203 | 4.6% |
| m | 4203 | 4.6% |
| u | 4203 | 4.6% |
| Other values (4) | 14124 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 91466 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 18575 | |
| o | 10954 | |
| a | 8895 | |
| B | 8895 | |
| e | 7734 | |
| G | 5477 | 6.0% |
| i | 4203 | 4.6% |
| M | 4203 | 4.6% |
| m | 4203 | 4.6% |
| u | 4203 | 4.6% |
| Other values (4) | 14124 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 91466 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 18575 | |
| o | 10954 | |
| a | 8895 | |
| B | 8895 | |
| e | 7734 | |
| G | 5477 | 6.0% |
| i | 4203 | 4.6% |
| M | 4203 | 4.6% |
| m | 4203 | 4.6% |
| u | 4203 | 4.6% |
| Other values (4) | 14124 |
saleDate
Date
| Distinct | 521 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 145.2 KiB |
| Minimum | 1932-09-02 00:00:00 |
|---|---|
| Maximum | 2031-11-30 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
make
Categorical
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1002.2 KiB |
| RENAULT | |
|---|---|
| VOLKSWAGEN | |
| PEUGEOT | |
| BMW | |
| OPEL | |
| Other values (35) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 6.2418304 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | HYUNDAI |
|---|---|
| 2nd row | NISSAN |
| 3rd row | NISSAN |
| 4th row | NISSAN |
| 5th row | VOLKSWAGEN |
Common Values
| Value | Count | Frequency (%) |
| RENAULT | 2255 | |
| VOLKSWAGEN | 1804 | 9.7% |
| PEUGEOT | 1740 | 9.4% |
| BMW | 1493 | 8.0% |
| OPEL | 1462 | 7.9% |
| MERCEDES-BENZ | 1020 | 5.5% |
| CITROEN | 993 | 5.3% |
| AUDI | 800 | 4.3% |
| FORD | 796 | 4.3% |
| FIAT | 785 | 4.2% |
| Other values (30) | 5427 |
Length
| Value | Count | Frequency (%) |
| renault | 2255 | |
| volkswagen | 1804 | 9.6% |
| peugeot | 1740 | 9.3% |
| bmw | 1493 | 8.0% |
| opel | 1462 | 7.8% |
| mercedes-benz | 1020 | 5.4% |
| citroen | 993 | 5.3% |
| audi | 800 | 4.3% |
| ford | 796 | 4.2% |
| fiat | 785 | 4.2% |
| Other values (31) | 5605 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 15300 | |
| O | 10821 | 9.3% |
| A | 10241 | 8.8% |
| T | 8384 | 7.2% |
| N | 8347 | 7.2% |
| L | 6657 | 5.7% |
| S | 5927 | 5.1% |
| R | 5772 | 5.0% |
| U | 5337 | 4.6% |
| I | 4817 | 4.2% |
| Other values (17) | 34339 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 115942 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 15300 | |
| O | 10821 | 9.3% |
| A | 10241 | 8.8% |
| T | 8384 | 7.2% |
| N | 8347 | 7.2% |
| L | 6657 | 5.7% |
| S | 5927 | 5.1% |
| R | 5772 | 5.0% |
| U | 5337 | 4.6% |
| I | 4817 | 4.2% |
| Other values (17) | 34339 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 115942 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 15300 | |
| O | 10821 | 9.3% |
| A | 10241 | 8.8% |
| T | 8384 | 7.2% |
| N | 8347 | 7.2% |
| L | 6657 | 5.7% |
| S | 5927 | 5.1% |
| R | 5772 | 5.0% |
| U | 5337 | 4.6% |
| I | 4817 | 4.2% |
| Other values (17) | 34339 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 115942 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 15300 | |
| O | 10821 | 9.3% |
| A | 10241 | 8.8% |
| T | 8384 | 7.2% |
| N | 8347 | 7.2% |
| L | 6657 | 5.7% |
| S | 5927 | 5.1% |
| R | 5772 | 5.0% |
| U | 5337 | 4.6% |
| I | 4817 | 4.2% |
| Other values (17) | 34339 |
model
Text
| Distinct | 283 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1006.1 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 4.9831494 |
| Min length | 1 |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Kauai |
|---|---|
| 2nd row | Juke |
| 3rd row | Juke |
| 4th row | Juke |
| 5th row | Golf |
| Value | Count | Frequency (%) |
| mégan | 1148 | 5.4% |
| classe | 1016 | 4.8% |
| clio | 787 | 3.7% |
| golf | 696 | 3.3% |
| astra | 695 | 3.3% |
| c | 658 | 3.1% |
| serie-3 | 607 | 2.9% |
| polo | 602 | 2.8% |
| corsa | 570 | 2.7% |
| ibiza | 443 | 2.1% |
| Other values (274) | 14044 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8551 | 9.2% |
| o | 6201 | 6.7% |
| i | 6086 | 6.6% |
| s | 5937 | 6.4% |
| e | 5897 | 6.4% |
| C | 4830 | 5.2% |
| r | 4503 | 4.9% |
| l | 3773 | 4.1% |
| n | 3146 | 3.4% |
| 0 | 3032 | 3.3% |
| Other values (56) | 40606 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 92562 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 8551 | 9.2% |
| o | 6201 | 6.7% |
| i | 6086 | 6.6% |
| s | 5937 | 6.4% |
| e | 5897 | 6.4% |
| C | 4830 | 5.2% |
| r | 4503 | 4.9% |
| l | 3773 | 4.1% |
| n | 3146 | 3.4% |
| 0 | 3032 | 3.3% |
| Other values (56) | 40606 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 92562 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 8551 | 9.2% |
| o | 6201 | 6.7% |
| i | 6086 | 6.6% |
| s | 5937 | 6.4% |
| e | 5897 | 6.4% |
| C | 4830 | 5.2% |
| r | 4503 | 4.9% |
| l | 3773 | 4.1% |
| n | 3146 | 3.4% |
| 0 | 3032 | 3.3% |
| Other values (56) | 40606 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 92562 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 8551 | 9.2% |
| o | 6201 | 6.7% |
| i | 6086 | 6.6% |
| s | 5937 | 6.4% |
| e | 5897 | 6.4% |
| C | 4830 | 5.2% |
| r | 4503 | 4.9% |
| l | 3773 | 4.1% |
| n | 3146 | 3.4% |
| 0 | 3032 | 3.3% |
| Other values (56) | 40606 |
doorNumber
Categorical
High correlation Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 907.1 KiB |
| 5 | |
|---|---|
| 3 | |
| 4 | |
| 2 | 155 |
| 6 | 15 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 5 |
| 5th row | 5 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 14601 | |
| 3 | 1922 | 10.3% |
| 4 | 1882 | 10.1% |
| 2 | 155 | 0.8% |
| 6 | 15 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 14601 | |
| 3 | 1922 | 10.3% |
| 4 | 1882 | 10.1% |
| 2 | 155 | 0.8% |
| 6 | 15 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 14601 | |
| 3 | 1922 | 10.3% |
| 4 | 1882 | 10.1% |
| 2 | 155 | 0.8% |
| 6 | 15 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18575 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 14601 | |
| 3 | 1922 | 10.3% |
| 4 | 1882 | 10.1% |
| 2 | 155 | 0.8% |
| 6 | 15 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18575 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 14601 | |
| 3 | 1922 | 10.3% |
| 4 | 1882 | 10.1% |
| 2 | 155 | 0.8% |
| 6 | 15 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18575 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 14601 | |
| 3 | 1922 | 10.3% |
| 4 | 1882 | 10.1% |
| 2 | 155 | 0.8% |
| 6 | 15 | 0.1% |
type
Categorical
High correlation
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| Hatchback | |
|---|---|
| Estate | |
| Sedan | |
| Coupe | 582 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 7.5155316 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hatchback |
|---|---|
| 2nd row | Hatchback |
| 3rd row | Hatchback |
| 4th row | Hatchback |
| 5th row | Estate |
Common Values
| Value | Count | Frequency (%) |
| Hatchback | 10189 | |
| Estate | 5970 | |
| Sedan | 1834 | 9.9% |
| Coupe | 582 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hatchback | 10189 | |
| estate | 5970 | |
| sedan | 1834 | 9.9% |
| coupe | 582 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 28182 | |
| t | 22129 | |
| c | 20378 | |
| H | 10189 | 7.3% |
| h | 10189 | 7.3% |
| b | 10189 | 7.3% |
| k | 10189 | 7.3% |
| e | 8386 | 6.0% |
| E | 5970 | 4.3% |
| s | 5970 | 4.3% |
| Other values (7) | 7830 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 139601 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 28182 | |
| t | 22129 | |
| c | 20378 | |
| H | 10189 | 7.3% |
| h | 10189 | 7.3% |
| b | 10189 | 7.3% |
| k | 10189 | 7.3% |
| e | 8386 | 6.0% |
| E | 5970 | 4.3% |
| s | 5970 | 4.3% |
| Other values (7) | 7830 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 139601 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 28182 | |
| t | 22129 | |
| c | 20378 | |
| H | 10189 | 7.3% |
| h | 10189 | 7.3% |
| b | 10189 | 7.3% |
| k | 10189 | 7.3% |
| e | 8386 | 6.0% |
| E | 5970 | 4.3% |
| s | 5970 | 4.3% |
| Other values (7) | 7830 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 139601 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 28182 | |
| t | 22129 | |
| c | 20378 | |
| H | 10189 | 7.3% |
| h | 10189 | 7.3% |
| b | 10189 | 7.3% |
| k | 10189 | 7.3% |
| e | 8386 | 6.0% |
| E | 5970 | 4.3% |
| s | 5970 | 4.3% |
| Other values (7) | 7830 | 5.6% |
fuel
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 998.0 KiB |
| Diesel | |
|---|---|
| Petrol | |
| Electric | 93 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.0100135 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Petrol |
|---|---|
| 2nd row | Diesel |
| 3rd row | Diesel |
| 4th row | Diesel |
| 5th row | Diesel |
Common Values
| Value | Count | Frequency (%) |
| Diesel | 12466 | |
| Petrol | 6016 | |
| Electric | 93 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| diesel | 12466 | |
| petrol | 6016 | |
| electric | 93 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 31041 | |
| l | 18575 | |
| i | 12559 | |
| D | 12466 | |
| s | 12466 | |
| t | 6109 | 5.5% |
| r | 6109 | 5.5% |
| P | 6016 | 5.4% |
| o | 6016 | 5.4% |
| c | 186 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 111636 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 31041 | |
| l | 18575 | |
| i | 12559 | |
| D | 12466 | |
| s | 12466 | |
| t | 6109 | 5.5% |
| r | 6109 | 5.5% |
| P | 6016 | 5.4% |
| o | 6016 | 5.4% |
| c | 186 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 111636 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 31041 | |
| l | 18575 | |
| i | 12559 | |
| D | 12466 | |
| s | 12466 | |
| t | 6109 | 5.5% |
| r | 6109 | 5.5% |
| P | 6016 | 5.4% |
| o | 6016 | 5.4% |
| c | 186 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 111636 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 31041 | |
| l | 18575 | |
| i | 12559 | |
| D | 12466 | |
| s | 12466 | |
| t | 6109 | 5.5% |
| r | 6109 | 5.5% |
| P | 6016 | 5.4% |
| o | 6016 | 5.4% |
| c | 186 | 0.2% |
transmission
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1003.8 KiB |
| Manual | |
|---|---|
| Automatic |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.3299596 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manual |
|---|---|
| 2nd row | Manual |
| 3rd row | Manual |
| 4th row | Manual |
| 5th row | Automatic |
Common Values
| Value | Count | Frequency (%) |
| Manual | 16532 | |
| Automatic | 2043 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manual | 16532 | |
| automatic | 2043 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 35107 | |
| u | 18575 | |
| M | 16532 | |
| n | 16532 | |
| l | 16532 | |
| t | 4086 | 3.5% |
| A | 2043 | 1.7% |
| o | 2043 | 1.7% |
| m | 2043 | 1.7% |
| i | 2043 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 117579 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 35107 | |
| u | 18575 | |
| M | 16532 | |
| n | 16532 | |
| l | 16532 | |
| t | 4086 | 3.5% |
| A | 2043 | 1.7% |
| o | 2043 | 1.7% |
| m | 2043 | 1.7% |
| i | 2043 | 1.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 117579 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 35107 | |
| u | 18575 | |
| M | 16532 | |
| n | 16532 | |
| l | 16532 | |
| t | 4086 | 3.5% |
| A | 2043 | 1.7% |
| o | 2043 | 1.7% |
| m | 2043 | 1.7% |
| i | 2043 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 117579 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 35107 | |
| u | 18575 | |
| M | 16532 | |
| n | 16532 | |
| l | 16532 | |
| t | 4086 | 3.5% |
| A | 2043 | 1.7% |
| o | 2043 | 1.7% |
| m | 2043 | 1.7% |
| i | 2043 | 1.7% |
yearIntroduced
Real number (ℝ)
High correlation
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2006.9496 |
| Minimum | 1985 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 1985 |
|---|---|
| 5-th percentile | 1998 |
| Q1 | 2003 |
| median | 2007 |
| Q3 | 2011 |
| 95-th percentile | 2016 |
| Maximum | 2020 |
| Range | 35 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.192612 |
|---|---|
| Coefficient of variation (CV) | 0.0025873156 |
| Kurtosis | -0.34724081 |
| Mean | 2006.9496 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.11338301 |
| Sum | 37279089 |
| Variance | 26.963219 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2008 | 1614 | 8.7% |
| 2005 | 1516 | 8.2% |
| 2009 | 1484 | 8.0% |
| 2007 | 1399 | 7.5% |
| 2004 | 1215 | 6.5% |
| 2012 | 1074 | 5.8% |
| 2006 | 1046 | 5.6% |
| 2003 | 1006 | 5.4% |
| 2001 | 904 | 4.9% |
| 2013 | 898 | 4.8% |
| Other values (24) | 6419 |
| Value | Count | Frequency (%) |
| 1985 | 5 | < 0.1% |
| 1987 | 1 | < 0.1% |
| 1989 | 2 | < 0.1% |
| 1990 | 1 | < 0.1% |
| 1991 | 14 | 0.1% |
| 1992 | 28 | 0.2% |
| 1993 | 57 | 0.3% |
| 1994 | 40 | 0.2% |
| 1995 | 49 | 0.3% |
| 1996 | 143 |
| Value | Count | Frequency (%) |
| 2020 | 2 | < 0.1% |
| 2019 | 25 | 0.1% |
| 2018 | 147 | 0.8% |
| 2017 | 237 | 1.3% |
| 2016 | 563 | |
| 2015 | 544 | |
| 2014 | 645 | |
| 2013 | 898 | |
| 2012 | 1074 | |
| 2011 | 547 |
cylinder
Real number (ℝ)
High correlation
| Distinct | 39 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.460511 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 93 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 12 |
| median | 15 |
| Q3 | 19 |
| 95-th percentile | 21 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.0429799 |
|---|---|
| Coefficient of variation (CV) | 0.26150363 |
| Kurtosis | 4.7118978 |
| Mean | 15.460511 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.86946656 |
| Sum | 287179 |
| Variance | 16.345687 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 3295 | |
| 12 | 2889 | |
| 20 | 2746 | |
| 15 | 2429 | |
| 14 | 2300 | |
| 11 | 952 | 5.1% |
| 19 | 903 | 4.9% |
| 10 | 864 | 4.7% |
| 21 | 439 | 2.4% |
| 17 | 316 | 1.7% |
| Other values (29) | 1442 |
| Value | Count | Frequency (%) |
| 0 | 93 | 0.5% |
| 6 | 10 | 0.1% |
| 7 | 31 | 0.2% |
| 8 | 127 | 0.7% |
| 9 | 151 | 0.8% |
| 10 | 864 | 4.7% |
| 11 | 952 | 5.1% |
| 12 | 2889 | |
| 13 | 250 | 1.3% |
| 14 | 2300 |
| Value | Count | Frequency (%) |
| 50 | 3 | |
| 48 | 1 | < 0.1% |
| 47 | 1 | < 0.1% |
| 44 | 7 | |
| 43 | 1 | < 0.1% |
| 42 | 3 | |
| 41 | 1 | < 0.1% |
| 40 | 3 | |
| 39 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
cubeCapacity
Real number (ℝ)
High correlation
| Distinct | 212 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1540.2247 |
| Minimum | 0 |
|---|---|
| Maximum | 4966 |
| Zeros | 93 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 999 |
| Q1 | 1248 |
| median | 1461 |
| Q3 | 1870 |
| 95-th percentile | 2143 |
| Maximum | 4966 |
| Range | 4966 |
| Interquartile range (IQR) | 622 |
Descriptive statistics
| Standard deviation | 397.66174 |
|---|---|
| Coefficient of variation (CV) | 0.25818424 |
| Kurtosis | 5.128262 |
| Mean | 1540.2247 |
| Median Absolute Deviation (MAD) | 215 |
| Skewness | 0.95676335 |
| Sum | 28609673 |
| Variance | 158134.86 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1461 | 2049 | 11.0% |
| 1560 | 1565 | 8.4% |
| 1598 | 1392 | 7.5% |
| 1995 | 1117 | 6.0% |
| 1248 | 954 | 5.1% |
| 1896 | 630 | 3.4% |
| 1968 | 623 | 3.4% |
| 1198 | 583 | 3.1% |
| 1398 | 582 | 3.1% |
| 1242 | 491 | 2.6% |
| Other values (202) | 8589 |
| Value | Count | Frequency (%) |
| 0 | 93 | |
| 599 | 10 | 0.1% |
| 698 | 31 | 0.2% |
| 796 | 24 | 0.1% |
| 799 | 103 | |
| 875 | 13 | 0.1% |
| 898 | 132 | |
| 899 | 6 | < 0.1% |
| 954 | 12 | 0.1% |
| 973 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 4966 | 3 | |
| 4806 | 1 | < 0.1% |
| 4663 | 1 | < 0.1% |
| 4398 | 3 | |
| 4395 | 4 | |
| 4293 | 1 | < 0.1% |
| 4196 | 2 | |
| 4172 | 1 | < 0.1% |
| 4134 | 1 | < 0.1% |
| 3997 | 2 |
powerKW
Real number (ℝ)
High correlation
| Distinct | 137 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.892544 |
| Minimum | 29 |
|---|---|
| Maximum | 412 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 29 |
|---|---|
| 5-th percentile | 45 |
| Q1 | 55 |
| median | 74 |
| Q3 | 88 |
| 95-th percentile | 132 |
| Maximum | 412 |
| Range | 383 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 28.744424 |
|---|---|
| Coefficient of variation (CV) | 0.36902665 |
| Kurtosis | 8.4773965 |
| Mean | 77.892544 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 1.9533692 |
| Sum | 1446854 |
| Variance | 826.24193 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 66 | 1905 | 10.3% |
| 55 | 1521 | 8.2% |
| 81 | 1381 | 7.4% |
| 77 | 860 | 4.6% |
| 51 | 847 | 4.6% |
| 50 | 698 | 3.8% |
| 80 | 663 | 3.6% |
| 70 | 647 | 3.5% |
| 85 | 601 | 3.2% |
| 110 | 595 | 3.2% |
| Other values (127) | 8857 |
| Value | Count | Frequency (%) |
| 29 | 6 | < 0.1% |
| 30 | 45 | 0.2% |
| 33 | 29 | 0.2% |
| 37 | 127 | |
| 38 | 24 | 0.1% |
| 39 | 40 | 0.2% |
| 40 | 72 | |
| 41 | 5 | < 0.1% |
| 42 | 4 | < 0.1% |
| 43 | 87 |
| Value | Count | Frequency (%) |
| 412 | 3 | |
| 320 | 1 | < 0.1% |
| 317 | 1 | < 0.1% |
| 315 | 2 | |
| 300 | 1 | < 0.1% |
| 294 | 1 | < 0.1% |
| 280 | 4 | |
| 272 | 1 | < 0.1% |
| 270 | 3 | |
| 258 | 1 | < 0.1% |
powerHP
Real number (ℝ)
High correlation
| Distinct | 160 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 105.99736 |
| Minimum | 39 |
|---|---|
| Maximum | 560 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 39 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 75 |
| median | 100 |
| Q3 | 120 |
| 95-th percentile | 180 |
| Maximum | 560 |
| Range | 521 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 39.065234 |
|---|---|
| Coefficient of variation (CV) | 0.36854912 |
| Kurtosis | 8.4892257 |
| Mean | 105.99736 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 1.9561873 |
| Sum | 1968901 |
| Variance | 1526.0925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 1882 | 10.1% |
| 75 | 1534 | 8.3% |
| 110 | 1409 | 7.6% |
| 105 | 1049 | 5.6% |
| 95 | 648 | 3.5% |
| 109 | 638 | 3.4% |
| 68 | 620 | 3.3% |
| 150 | 587 | 3.2% |
| 70 | 582 | 3.1% |
| 115 | 554 | 3.0% |
| Other values (150) | 9072 |
| Value | Count | Frequency (%) |
| 39 | 5 | < 0.1% |
| 40 | 1 | < 0.1% |
| 41 | 45 | 0.2% |
| 45 | 29 | 0.2% |
| 50 | 127 | |
| 51 | 23 | 0.1% |
| 52 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 54 | 80 | |
| 55 | 36 | 0.2% |
| Value | Count | Frequency (%) |
| 560 | 3 | |
| 435 | 1 | < 0.1% |
| 431 | 1 | < 0.1% |
| 428 | 2 | |
| 407 | 1 | < 0.1% |
| 400 | 1 | < 0.1% |
| 381 | 4 | |
| 370 | 1 | < 0.1% |
| 367 | 3 | |
| 350 | 1 | < 0.1% |
targetPrice
Real number (ℝ)
High correlation
| Distinct | 407 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5960.217 |
| Minimum | 400 |
|---|---|
| Maximum | 60200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 145.2 KiB |
Quantile statistics
| Minimum | 400 |
|---|---|
| 5-th percentile | 850 |
| Q1 | 2000 |
| median | 4100 |
| Q3 | 7700 |
| 95-th percentile | 17000 |
| Maximum | 60200 |
| Range | 59800 |
| Interquartile range (IQR) | 5700 |
Descriptive statistics
| Standard deviation | 6021.3456 |
|---|---|
| Coefficient of variation (CV) | 1.0102561 |
| Kurtosis | 10.856872 |
| Mean | 5960.217 |
| Median Absolute Deviation (MAD) | 2400 |
| Skewness | 2.7156971 |
| Sum | 1.1071103 × 108 |
| Variance | 36256603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 402 | 2.2% |
| 1300 | 322 | 1.7% |
| 1700 | 301 | 1.6% |
| 1200 | 295 | 1.6% |
| 1600 | 294 | 1.6% |
| 1100 | 277 | 1.5% |
| 2300 | 272 | 1.5% |
| 1000 | 270 | 1.5% |
| 1900 | 268 | 1.4% |
| 2000 | 267 | 1.4% |
| Other values (397) | 15607 |
| Value | Count | Frequency (%) |
| 400 | 30 | 0.2% |
| 450 | 44 | 0.2% |
| 500 | 115 | |
| 530 | 1 | < 0.1% |
| 550 | 33 | 0.2% |
| 600 | 136 | |
| 650 | 39 | 0.2% |
| 700 | 202 | |
| 750 | 46 | 0.2% |
| 800 | 244 |
| Value | Count | Frequency (%) |
| 60200 | 1 | |
| 57800 | 2 | |
| 57600 | 1 | |
| 57400 | 1 | |
| 57300 | 1 | |
| 56900 | 1 | |
| 56600 | 1 | |
| 52600 | 1 | |
| 52300 | 2 | |
| 50000 | 1 |
Interactions
Correlations
| aestheticGrade | colour | cubeCapacity | cylinder | doorNumber | fuel | kilometers | make | mechanicalGrade | powerHP | powerKW | targetPrice | transmission | type | yearIntroduced | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| aestheticGrade | 1.000 | 0.080 | 0.079 | 0.073 | 0.017 | 0.101 | 0.056 | 0.137 | 0.210 | 0.106 | 0.106 | 0.228 | 0.177 | 0.038 | 0.215 |
| colour | 0.080 | 1.000 | 0.066 | 0.071 | 0.091 | 0.124 | 0.051 | 0.101 | 0.079 | 0.054 | 0.054 | 0.064 | 0.058 | 0.113 | 0.116 |
| cubeCapacity | 0.079 | 0.066 | 1.000 | 0.993 | 0.218 | 0.793 | 0.319 | 0.383 | 0.084 | 0.846 | 0.844 | 0.358 | 0.464 | 0.353 | 0.085 |
| cylinder | 0.073 | 0.071 | 0.993 | 1.000 | 0.216 | 0.806 | 0.322 | 0.356 | 0.089 | 0.847 | 0.845 | 0.352 | 0.439 | 0.338 | 0.078 |
| doorNumber | 0.017 | 0.091 | 0.218 | 0.216 | 1.000 | 0.132 | 0.069 | 0.309 | 0.044 | 0.190 | 0.191 | 0.084 | 0.131 | 0.684 | 0.103 |
| fuel | 0.101 | 0.124 | 0.793 | 0.806 | 0.132 | 1.000 | 0.127 | 0.307 | 0.087 | 0.339 | 0.337 | 0.201 | 0.238 | 0.272 | 0.239 |
| kilometers | 0.056 | 0.051 | 0.319 | 0.322 | 0.069 | 0.127 | 1.000 | 0.078 | 0.146 | 0.151 | 0.151 | -0.442 | 0.000 | 0.106 | -0.459 |
| make | 0.137 | 0.101 | 0.383 | 0.356 | 0.309 | 0.307 | 0.078 | 1.000 | 0.156 | 0.331 | 0.329 | 0.289 | 0.466 | 0.451 | 0.162 |
| mechanicalGrade | 0.210 | 0.079 | 0.084 | 0.089 | 0.044 | 0.087 | 0.146 | 0.156 | 1.000 | 0.100 | 0.100 | 0.295 | 0.235 | 0.059 | 0.323 |
| powerHP | 0.106 | 0.054 | 0.846 | 0.847 | 0.190 | 0.339 | 0.151 | 0.331 | 0.100 | 1.000 | 0.999 | 0.531 | 0.496 | 0.311 | 0.287 |
| powerKW | 0.106 | 0.054 | 0.844 | 0.845 | 0.191 | 0.337 | 0.151 | 0.329 | 0.100 | 0.999 | 1.000 | 0.530 | 0.496 | 0.313 | 0.285 |
| targetPrice | 0.228 | 0.064 | 0.358 | 0.352 | 0.084 | 0.201 | -0.442 | 0.289 | 0.295 | 0.531 | 0.530 | 1.000 | 0.469 | 0.103 | 0.837 |
| transmission | 0.177 | 0.058 | 0.464 | 0.439 | 0.131 | 0.238 | 0.000 | 0.466 | 0.235 | 0.496 | 0.496 | 0.469 | 1.000 | 0.279 | 0.253 |
| type | 0.038 | 0.113 | 0.353 | 0.338 | 0.684 | 0.272 | 0.106 | 0.451 | 0.059 | 0.311 | 0.313 | 0.103 | 0.279 | 1.000 | 0.113 |
| yearIntroduced | 0.215 | 0.116 | 0.085 | 0.078 | 0.103 | 0.239 | -0.459 | 0.162 | 0.323 | 0.287 | 0.285 | 0.837 | 0.253 | 0.113 | 1.000 |
Missing values
Sample
| vehicleID | registrationDate | kilometers | colour | aestheticGrade | mechanicalGrade | saleDate | make | model | doorNumber | type | fuel | transmission | yearIntroduced | cylinder | cubeCapacity | powerKW | powerHP | targetPrice | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | V_1232 | 23/01/2022 | 8984 | Red | Very Good | Very Good | 07/11/2022 | HYUNDAI | Kauai | 5 | Hatchback | Petrol | Manual | 2017 | 10 | 998 | 88 | 120 | 17300 |
| 1 | V_1233 | 23/01/2017 | 127566 | Red | Bad | Good | 20/08/2022 | NISSAN | Juke | 5 | Hatchback | Diesel | Manual | 2010 | 15 | 1461 | 81 | 110 | 8800 |
| 2 | V_1234 | 23/01/2017 | 127566 | Red | Bad | Good | 13/09/2022 | NISSAN | Juke | 5 | Hatchback | Diesel | Manual | 2010 | 15 | 1461 | 81 | 110 | 9600 |
| 3 | V_1235 | 23/01/2017 | 127566 | Red | Bad | Good | 16/08/2022 | NISSAN | Juke | 5 | Hatchback | Diesel | Manual | 2010 | 15 | 1461 | 81 | 110 | 8500 |
| 4 | V_1236 | 23/01/2016 | 108759 | Black | Bad | Very Good | 09/05/2022 | VOLKSWAGEN | Golf | 5 | Estate | Diesel | Automatic | 2013 | 16 | 1598 | 77 | 105 | 11300 |
| 5 | V_1237 | 23/01/2016 | 185798 | Black | Bad | Medium | 01/06/2022 | NISSAN | Qashqai | 5 | Estate | Diesel | Manual | 2009 | 16 | 1598 | 96 | 130 | 12200 |
| 6 | V_1238 | 23/01/2016 | 185798 | Black | Bad | Medium | 08/06/2022 | NISSAN | Qashqai | 5 | Estate | Diesel | Manual | 2009 | 16 | 1598 | 96 | 130 | 11700 |
| 7 | V_1239 | 23/01/2013 | 179959 | Black | Bad | Medium | 18/05/2022 | RENAULT | Clio | 5 | Estate | Diesel | Manual | 2009 | 15 | 1461 | 66 | 90 | 3900 |
| 8 | V_12310 | 23/01/2013 | 195703 | Grey | Medium | Bad | 16/11/2022 | FORD | Focus | 5 | Estate | Diesel | Manual | 2008 | 16 | 1560 | 80 | 109 | 2700 |
| 9 | V_12311 | 23/01/2013 | 197582 | Black | Very Bad | Bad | 28/08/2022 | ALFA ROMEO | Giulietta | 5 | Hatchback | Diesel | Manual | 2010 | 16 | 1598 | 77 | 105 | 4900 |
| vehicleID | registrationDate | kilometers | colour | aestheticGrade | mechanicalGrade | saleDate | make | model | doorNumber | type | fuel | transmission | yearIntroduced | cylinder | cubeCapacity | powerKW | powerHP | targetPrice | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 18565 | V_12318567 | 14/09/2016 | 71420 | Grey | Bad | Medium | 11/10/2023 | TOYOTA | Auris | 5 | Estate | Diesel | Manual | 2013 | 14 | 1364 | 66 | 90 | 10500 |
| 18566 | V_12318568 | 22/06/2008 | 253884 | Black | Very Bad | Very Bad | 18/10/2023 | MERCEDES-BENZ | Classe C | 5 | Estate | Diesel | Manual | 2001 | 21 | 2148 | 105 | 143 | 3500 |
| 18567 | V_12318569 | 10/09/2005 | 135036 | Grey | Very Bad | Good | 16/10/2023 | AUDI | A4 | 5 | Estate | Diesel | Automatic | 2001 | 19 | 1896 | 96 | 130 | 5300 |
| 18568 | V_12318570 | 19/08/2014 | 197823 | Black | Bad | Bad | 11/10/2023 | NISSAN | Qashqai | 5 | Estate | Diesel | Manual | 2009 | 15 | 1461 | 81 | 109 | 8700 |
| 18569 | V_12318571 | 18/12/2010 | 190766 | Grey | Bad | Bad | 11/10/2023 | AUDI | A3 | 5 | Estate | Diesel | Manual | 2008 | 20 | 1968 | 103 | 140 | 6700 |
| 18570 | V_12318572 | 22/08/2005 | 190913 | Black | Bad | Bad | 14/10/2023 | AUDI | A4 | 5 | Estate | Petrol | Manual | 2001 | 16 | 1595 | 75 | 102 | 2500 |
| 18571 | V_12318573 | 25/08/2020 | 278484 | Grey | Medium | Medium | 11/10/2023 | RENAULT | Mégan | 5 | Estate | Diesel | Manual | 2016 | 15 | 1461 | 81 | 110 | 8800 |
| 18572 | V_12318574 | 23/07/2017 | 157217 | Grey | Very Bad | Good | 11/10/2023 | SEAT | Leon | 5 | Estate | Diesel | Manual | 2013 | 16 | 1598 | 81 | 110 | 9700 |
| 18573 | V_12318575 | 21/06/2017 | 156823 | Grey | Bad | Very Good | 11/10/2023 | AUDI | A6 | 5 | Estate | Diesel | Automatic | 2014 | 20 | 1968 | 140 | 190 | 18900 |
| 18574 | V_12318576 | 20/11/2013 | 131553 | Black | Very Bad | Medium | 16/10/2023 | SEAT | Ibiza | 5 | Estate | Petrol | Manual | 2010 | 12 | 1198 | 51 | 70 | 4300 |